Performance Prediction by Trace Transformation

نویسنده

  • Celso L. Mendes
چکیده

Performance stability is an essential feature for the widespread adoption of multicomputers. In this paper, we report the preliminary steps of our research in performance prediction and extrapolation. Performance tuning, guided by extrapolation, may help achieve a substantial fraction of peak performance rates across a broader range of applications while providing guidance for code porting. We introduce a methodology for assessing stability of parallel programs, based on stability of the program execution graph, using time perturbation analysis. For programs with stable behavior, we present a model for performance prediction under architecture variations, by transformation of the execution traces with parameters that reeect the diierences in architecture between two systems. We illustrate the use of this transformation with an example of a parallel PDE solver executing on a multicomputer.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Stability and Prediction

This paper presents experimental data from our research on stability of parallel programs and cross-machine performance prediction on multicomputers. We characterize program behavior by an execution graph, obtained from running an instrumented version of the program. We assess program stability using time perturbations, and analyze the resulting execution graphs with an approximation of a graph...

متن کامل

Using Dynamic Branch Behavior for Power-Efficient Instruction Fetch

Power consumption has become an increasing concern in high performance microprocessor design in terms of packaging and cooling cost. The fetch unit including instruction cache contributes a large portion of the total power consumption in the microprocessor. The instruction cache itself suffers some hidden power consumption due to dynamic control flows. Although capturing the dynamic control flo...

متن کامل

Automatic Parallelization - New Approaches to Code Generation, Data Distribution, and Performance Prediction

This paper introduces the Weight Finder, an advanced profiler for Fortran programs,which is based on a von Neumann architecture. Existing Fortran codes are generally too large toanalyze fully in depth with respect to performance tuning. It is the responsibility of the WeightFinder to detect the most important regions of code in the program, as far as execution time isconcerned. ...

متن کامل

Improving Multiple-block Prediction in the Block-based Trace Cache

Multiple-block prediction is emerging as a new and exciting research area. Highly accurate multiple-block predictors are essential for wide instruction fetch mechanisms, that will support future generations of microprocessors. The block-based trace cache is a recent proposal for wide instruction fetch. It aligns and stores instructions at the basic block level instead of at the trace level, thu...

متن کامل

Some results on Haar wavelets matrix through linear algebra

Can we characterize the wavelets through linear transformation? the answer for this question is certainly YES. In this paper we have characterized the Haar wavelet matrix by their linear transformation and proved some theorems on properties of Haar wavelet matrix such as Trace, eigenvalue and eigenvector and diagonalization of a matrix.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993